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Written compositions of 200 second and third grade children were compared 
four times over a 2-year period to determine relationships of composition ability with 
Intelligence and reading level. A global essay instrument developed at the University 
of Georgia was used for the comparisons to find out whether or not the Instrument 
revealed any Improvement In the essays. Four raters scored each essay Independently 
and without knowledge of the other three ratings by following standardized 
procedures. Four mean test scores for each student formed a chronological 
sequence from the fall of 1966 through the spring of 1968. There was a stejady 
Increase In the global essay means of both boys and girls over 2 years. Correlations 
between reading level, vocabulary. Intelligence, and global essay scores showed 
positive relationships. There was not a significant correlation between global essay 
scores and sex or grade level. The study confirmed the sensitivity of the global essay 
Instrument In measuring the growth trend of second and third grade children. (DO) 
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Introduction 

A global essay instrument^ developed at the University of Georgia 
was used to compare written compositions produced at four testing 
dates over a two-year period. The chief purpose was to discover if 
the instrument could be used to reveal improvement in the quality of 
writing produced by the children in the sample group. A second purpose 
was to examine and identify any differences in growth in composition 
ability between the boys and the girls in the sample. Finally, a 
further purpose was to examine the relationships between composition 
ability and intelligence and between composition ability and reading 
level, as ascertained with standardized test instruments. 

From the results of other research, notably Diederich (1966) 
and Wilhelms (1967), one can assume that for large groups of children 
writing does improve from year to year. For the purposes of the 
study, two-hundred children subjects (ono -hundred boys and one- 
hundred girls) in second and third grades were identified and tested 
with the global essay instrument. To control for the problems of 
measuring change, such as fluctuation from testing to testing that 
is not true growth, several writing samples were taken over specific 
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intervals during a two year period. Essays were identified and rated 
from fall 1966, spring 1967, fall 1967, and spring 1968. As recom- 
mended by Diederich (1966), these four essays were used to determine 
a trend of study change in the global essay ratings. 

The Test Instrument 

The product-scale essay instrument with five forms (essay topics) 
was developed from a population of more than 1,000 second and third 
graders.^ This tjrpical product-scale instrument evaluates the 
"product^' (i*e., the child*s wrritten composition) by comparing it 
to other "products" to derive a relative measure of its merit. Actual 
second and third grade writing, produced and selected under standar- 
dized conditions, served as the models for a seven-point scale. 

Rater reliability coefficients for the global essay instrument ranged 
from the mid .60*s to the upper .70*s^as did the reliability coef- 
ficients for equivalent forms. Examinee reliability was checked at 
.70. Validity checks involving the original criteria for the essay 
instrument and teacher estimates of student composition ability 
yielded moderately high coefficients for this kind of test (for 
validity checks by original criteria .70; for validity checks by 
teacher estimates .56) , 

The Sample Population 

Two-hundred boys and girls were identified from the schools 
cooperating with the English Curriculum Study Center at the University 
of Georgia. This group was composed of fifty second grade boys, 

^Test development reported at 1967 and 1968 NCME meeting by 
L, R, Veal. 
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fifty second grade girls, fifty third grade boys, and fifty third 
grade girls. The subjects were followed over a two year period of 
development and tested on four separate occasions (each fall and 
spring for the 1966-67 and 1967-68 school years). The subjects were 
tested further to obtain subject data of several kinds. The subject 
variables examined included sex, grade, chronological age, and test 
scores on reading and mental maturity instruments. 

Third grades received the ’'California Short -Form Test of Mental 
Maturity” (1968 revision with 1965 norms; S-Form, Level IH) , and the 
Reading Test of the "California Achievement Tests” (1957 editions, 
WXyZ series, 1963 norms; Form W, Upper Primary). The second grades 
received the "California Short-Form Test of Mental Maturity” (1963 
revision, 1964 norms; S-Form, Level 1), and the Reading Test of 
the "California Achievement Tests” (1957 edition, WXYZ series, 1963 
norms; Form W, Lower Primary). The means and standard deviations 
for chronological age, reading level and vocabulary scores (G.P.’s) 
from the ’’California Reading Test” and mental maturity score (I.Q.’s 
from the "California Short -Form Test of Mental Maturity” are 
reported in Table I. 

The Rating Procedures 

As each essay was collected and processed, it was scored by 
four trained raters following standardized procedures specified by 
the global essay instrument. Each rater scored the essays indepen- 
dently and without knowledge of the other three ratings. The four 
raters revealed moderately high inter-rater reliability over the 
period of the study (.70). From the four separate ratings, a mean 
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rating was derived for each essay produced on each of the four testing 
dates; consequently, for each of the two-hundred subjects there were 
four mean test scores forming a chronological sequence from fall 1966 
through spring 1968. Means, standard deviations, and standard errors 
for boys and girls for each of the four test dates are reported in 
Table II. The data were analyzed statistically by means of a two 
by four factorial ANOV (Table III) and Duncan* s Multiple Range 
Test (Table IV). 

Findings 

The two by four (sex-by-testing dates) factorial arrangement 
revealed the testing dates to be highly significant. Sex and the 
sex -by-testing dates interaction were non -significant. The means 
of both the boys and the girls for the first testing were signifi- 
cantly different from all subsequent testings, although there were 
no significant differences between the boys and girls on any single 
testing or on overall performance. A non-significant change in 
rate was revealed. There was, however, a steady increase in the 
means of both boys and girls over the four testings during the 
two-year period. The combined means for both boys and girls also 
revealed a steady increase: 3.28, 4.58, 4.69, and 4.82. 

It is interesting to note that the largest increase (from the 
first to the second testing is from 3.28, a, quality rating below 
the mean of the global essay scale (and, consequently, below average 
quality), to 4.58, a quality rating above the mean. After rising 
above the mean, both boys and girls advanced more slowly on the 
quality scale. The tendency to level off above the mean is evident 
in Figure 1. At all testings, the means of the girls were slightly 
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higher than the means of the boys 9 when examined collectively by sex 
or when sub-grouped by grade. 

Additional analysis of subject variables was provided by computing 
product moment correlations between reading level scores 9 vocabulary 
scores of the reading test, intelligence (I.Q.) scores and global 
essay scores. These correlations showed positive relationships 
and are reported in Table V. Reading level correlated with essay 
scores significantly (.65), as did intelligence (.48). The highest 
correlation between a subject variable and essay scores was revealed 
between global essay ratings and the vocabulary section of the 
reading test (. 68 ). Vocabulary has been considered a major compon- 
ent of composition skill in the elementary graaes by other researchers. 
The data in this study support the findings of McLean (1964), Carlson 
(1963), Howell (1956), and others, namely, in that the children who 
scored high on the vocabulary measure employed in this study also 
tended to score high on the global essay instrument. 

Chronological age showed little relationship, and that negative, 
to global essay scores (Table V). While age is considered to be 
associated with composition quality generally, it should be noted 
that these four groups had extremely limited ranges. The overall 
mean of the sample for both grades was 97.74 months with a standard 
deviation of 7.66. These children were in the second or third 
grade classes on the basis of common age and/or maturity as dis- 
cerned within their respective schools. This may explain, in part, 
the lack of a significant correlation for any group when examined 
by sex and grade level. However, the correlation of chronological 
age and global essay quality for the entire sample became slightly 
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higher { , 22 ) ^ when the range was expanded to include both second and 
third grade boys and girls. 

Summary 

The growth study revealed a steady increase in global essay 
means for the written compositions produced by the children on four 
testing dates (fall I 9663 spring 1967g fall 19679 s'nd spring 1968). 
This finding is in agreement with Diederich (1966) and Wilhelms 
(1967) who have reported that growth can be measured for large groups 
from year to year. It is also a confirmation of the sensitivity 
of the global essay instrument in measuring the growth trend of 
second and third grade children. Although the mean essay ratings 
for girls were slightly higher than for boys consistently for all 
the testings 9 both boys and girls showed significant growth during 
the two year period. Consequently 9 it can be concluded that 9 for 
the children of this study g the rate of growth was not related to 

sex. 



Edleann Biesbrock 
University of Minnesota 
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Los Angeles, 1969 
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ANOV FOR SEX AND TEST DATES 
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Source 


df 


SS 




MS 


F 




Total 


399 


574,21 










Treatments 


7 


168,66 










Sex 


1 


,55 




,55 


.71 




Testings 


3 


167,80 




55,93 


71.71** 




S X T 


3 


.31 




.10 


.13 




Error 


392 


305.55 




,78 






** Significant 


at the ,01 level. 












TABLE IV 










COMPARISON 


OF ESSAY MEANS 


BY GRADE AND 


SEX 




Comparisons % 


B1 


Gi B2 


G2 


B3 


G3 B4 


G4 


Means s 


3,16 


3,29 4.53 

1 


4.58 


4.58 


4.71 4.87 


4.88 

1 

1 






^%1eans not underscored by the same line are significantly 
different as determined by Duncan ^s Multiple Range Test at 



the .01 level. 
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FIGURE 1 

GROWTH MEANS FOR BOYS AND GIRLS 
FOR FOUR TESTING DATES 
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